Class detection works for huggingface checkpoints #1800

mattdangerw · 2024-08-27T01:02:51Z

Fixes #1798. You can now do keras_nlp.models.Backbone("hf://google-bert/bert-base-uncased") with a safetensors checkpoint, are we will find the correct architecture to instantiate. This has always worked for Keras-style checkpoints, but not safetensor ones.

This was a tricky one to fix that involved some large refactoring to our preset loading routine.

Originally the intent was that from_preset() was a easily readable bunch of lower-level Keras calls. With the arrival of transformers conversions, and soon timm conversions, I think that goal is no longer super realistic. Instead I added a loader interface, with default implementations off load_task and load_preprocessor. Every format we support directly converting from has to support at a minimum...

Detecting the backbone class.
Loading the backbone class.

One consequence of this work is that every class with a from_preset constructor needs to reference the backbone_cls they match with. I think this will be a more stable way to handle our "auto class" like functionality as we venture further towards multi-modal models

This was a tricky one to fix that involved some large refactoring to our preset loading routines. Originally the intent was that `from_preset()` was a easily readable bunch of lower-level Keras calls. With the arrival of transformers conversions, and soon timm conversions, I think that goal is no longer super realistic. Instead I added a loader interface, with default implementations off `load_task` and `load_preprocessor`. Every format we support directly converting from has to support at a minimum... - Detecting the backbone class. - Loading the backbone class. One consequence of this work is that every class with a `from_preset` constructor needs to reference the `backbone_cls` they match with. I think this will be a more stable way to handle our "auto class" like functionality as we venture further towards multi-modal models

SamanehSaadat

LGTM! Thanks, Matt! I think the new design looks very nice!

keras_nlp/src/utils/preset_utils.py

mattdangerw requested a review from SamanehSaadat August 27, 2024 01:03

mattdangerw force-pushed the from-preset-improvements branch 3 times, most recently from 893fad0 to c2fe6e8 Compare August 27, 2024 02:35

mattdangerw force-pushed the from-preset-improvements branch from c2fe6e8 to 3e7f29c Compare August 27, 2024 03:42

SamanehSaadat approved these changes Aug 27, 2024

View reviewed changes

keras_nlp/src/utils/preset_utils.py Outdated Show resolved Hide resolved

Address comments

0bf767a

mattdangerw merged commit 0c04abe into keras-team:master Aug 28, 2024
8 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Class detection works for huggingface checkpoints #1800

Class detection works for huggingface checkpoints #1800

mattdangerw commented Aug 27, 2024 •

edited

Loading

SamanehSaadat left a comment

Class detection works for huggingface checkpoints #1800

Class detection works for huggingface checkpoints #1800

Conversation

mattdangerw commented Aug 27, 2024 • edited Loading

SamanehSaadat left a comment

Choose a reason for hiding this comment

mattdangerw commented Aug 27, 2024 •

edited

Loading